MIRE: A Multidimensional Information Retrieval Engine for Structured Data and Text
نویسندگان
چکیده
This paper presents an original informationretrieval engine, called MIRE, for integrating structured data and text. Among other things, MIRE is designed to work in a natural and efficient way with the inherent hierarchies of structured data. While multi-dimensional access methods have originally been developed for spatial applications, they can be successfully used to index hierarchical structured data and add to an existing information-retrieval engine the capability of navigating hierarchical dimensions. To support this capability, MIRE enhances the processing algorithms of an existing multidimensional access method to avoid overflow and support for hierarchical dimensions. Compared to a search engine with multiple indexes for a different type of search, the multidimensional approach shows a significant reduction in the number of page accesses over a large document collection.
منابع مشابه
Adopting the Information Retrieval Approach for Storing and Retrieving Thai-text Structured Data
This paper describes an approach of using full-text search engine in storing and retrieving structured data in Thai language. It discusses some limitations of database management system (DBMS) in querying Thai full-text based content. These limitations can result in degrading of retrieval performance both in terms of result accuracy and system response time. Information Retrieval (IR) system or...
متن کاملUsing Text Surrounding Method to Enhance Retrieval of Online Images by Google Search Engine
Purpose: the current research aimed to compare the effectiveness of various tags and codes for retrieving images from the Google. Design/methodology: selected images with different characteristics in a registered domain were carefully studied. The exception was that special conceptual features have been apportioned for each group of images separately. In this regard, each group image surr...
متن کاملSIREn: Entity Retrieval System for the Web of Data
We present ongoing work on the Semantic Information Retrieval Engine (SIREn), an “entity retrieval system” specifically designed to meet the requirements of indexing and searching a large amount of semi-structured data, e.g. the entire Web of Data. SIREn supports efficient full text search with semi-structural queries and exhibits a concise index, constant time updates and inherits Information ...
متن کاملToward Entity Retrieval over Structured and Text Data
Many real-world applications increasingly involve both structured data and text. Hence, managing both in an efficient and integrated manner has received much attention from both the IR and database communities. To date, however, little research has been devoted to semantic issues in the integration of text and data. In this paper we introduced a problem in this realm: entity retrieval. Given da...
متن کاملThe Study on Lucene Based IETM Information Retrieval
With the intensive and large scale application of IETM in equipment integrated support, information retrieval technology becomes one of the most key technologies. This article discusses the full-text search technology and Lucene full-text retrieval engine, and combines them to develop a highperformance scalable IETM full-text retrieval system, this system can effectively deal with IETM unstruct...
متن کامل